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Recommendation G.729 



CODING OF SPEECH AT 8 kbit/s USING CONJUGATE-STRUCTURE 
ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (CS-ACELP) 

(Geneva, 1996) 



1 Introduction 

This Recommendation contains the description of an algorithm for the coding of speech signals at 8 kbit/s using 
Conjugate-Structure Algebraic-Code-Excited Linear-Prediction (CS-ACELP). 

This coder is designed to operate with a digital signal obtained by first performing telephone bandwidth filtering 
(Recommendation G.712).of the analogue input signal, then sampling it at 8000 Hz, followed by conversion to 16-bit 
linear PCM for the input to the encoder. The output of the decoder should be converted back to an analogue signal by 
similar means. Other input/output characteristics, such as those specified by Recommendation G.71 1 for 64 kbit/s PCM 
data, should be converted to 16-bit linear PCM before encoding, or from 16-bit linear PCM to the appropriate format 
after decoding. The bitstream from the encoder to the decoder is defined within this Recommendation. 

This Recommendation is organized as follows: Clause 2 gives a general outline of the CS-ACELP algorithm. In 
clauses 3 and 4, the CS-ACELP encoder and decoder principles are discussed, respectively. Clause 5 describes the 
software that defines this coder in 16 bit fixed-point arithmetic. 



2 General description of the coder 

The CS-ACELP coder is based on the Code-Excited Linear-Prediction (CELP) coding model. The coder operates on 
speech frames of 10 ms corresponding to 80 samples at a sampling rate of 8000 samples per second. For every 10 ms 
frame, the speech signal is analysed to extract the parameters of the CELP model (linear-prediction filter coefficients, 
adaptive and fixed-codebook indices and gains). These parameters are encoded and transmitted. The bit allocation of the 
coder parameters is shown in Table 1. At the decoder, these parameters are used to retrieve the excitation and synthesis 
filter parameters. The speech is reconstructed by filtering this excitation through the short-term synthesis filter, as is 
shown in Figure 1. The short-term synthesis filter is based on a 10th order Linear Prediction (LP) filter. The long-term, 
or pitch synthesis filter is implemented using the so-called adaptive-codebook approach. After computing the 
reconstructed speech, it is further enhanced by a postfilter. 



TABLE 1/G.729 
Bit allocation of the 8 kbit/s CS-ACELP algorithm (10 ms frame) 



Parameter 


Codeword 


Subframe 1 


Subframe 2 


Total per frame 


Line spectrum pairs 


Zj0,Z.1,L2, L3 






18 


Adaptive-codebook delay 


P\, PI 


8 


5 


13 


Pitch-delay parity 


P0 


1 




1 


Fixed-codebook index 


C1,C2 


13 


13 


26 


Fixed-codebook sign 


S\,S2 


4 


4 


8 


Codebook gains (stage 1) 


GA\,GA2 


3 


3 


6 


Codebook gains (stage 2) 


GB\,GB2 


4 


4 


8 


Total 








80 



Recommendation G.729 (03/96) 1 



Excitation 




Long-term 
synthesis 
filter 




Short-term 
synthesis 
filter . . 


► 


Post 




codebook 


► 


► 




filter 





T 



Parameter decoding 

T 

I 

Received bitstream 



Output 
speech 



T1518640-95/d01 



FIGURE 1/G.729 
Block diagram of conceptual CELP synthesis model 



2.1 Encoder 

The encoding principle is shown in Figure 2. The input signal is high-pass filtered and scaled in the pre-processing 
block. The pre-processed signal serves as the input signal for all subsequent analysis. LP analysis is done once per 10 ms 
frame to compute the LP filter coefficients. These coefficients are converted to Line Spectrum Pairs (LSP) and quantized 
using predictive two-stage Vector Quantization (VQ) with 18 bits. The excitation signal is chosen by using an analysis- 
by-synthesis search procedure in which the error between the original and reconstructed speech is minimized according 
to a perceptually weighted distortion measure. This is done by filtering the error signal with a perceptual weighting filter, 
whose coefficients are derived from the unquantized LP filter. The amount of perceptual weighting is made adaptive to 
improve the performance for input signals with a flat frequency-response. 



The excitation parameters (fixed and adaptive-codebook parameters) are determined per subframe of 5 ms (40 samples) 
each. The quantized and unquantized LP filter coefficients are used for the second subframe, while in the first subframe. 
interpolated LP filter coefficients are used (both quantized and unquantized). An open-loop pitch delay is estimated once 
per 10 ms frame based on the perceptually weighted speech signal. Then the following operations are repeated for each 
subframe. The target signal x(n) is computed by filtering the LP residual through the weighted synthesis filter W(z)/A(z). 
The initial states of these filters are updated by filtering the error between LP residual and excitation. This is equivalent 
to the common approach of subtracting the zero-input response of the weighted synthesis filter from the weighted speech 
signal. The impulse response h(n) of the weighted synthesis filter is computed. Closed-loop pitch analysis is then done 
(to find the adaptive-codebook delay and gain), using the target x(n) and impulse response h(n\ by searching around the 
value of the open-loop pitch delay. A fractional pitch delay with 1/3 resolution is used. The pitch delay is encoded with 
8 bits in the first subframe and differentially encoded with 5 bits in the second subframe. The target signal x(n) is 
updated by subtracting the (filtered) adaptive-codebook contribution, and this new target, x'in), is used in the fixed- 
codebook search to find the optimum excitation. An algebraic codebook with 17 bits is used for the fixed-codebook 
excitation. The gains of the adaptive and fixed-codebook contributions are vector quantized with 7 bits, (with MA 
prediction applied to the fixed-codebook gain). Finally, the filter memories are updated using the determined excitation 
signal. 
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FIGURE 2/G.729 
Encoding principle of the CS-ACELP encoder 



2.2 Decoder 

The decoder principle is shown in Figure 3. First, the parameter's indices are extracted from the received bitstream. 
These indices are decoded to obtain the coder parameters corresponding to a 10 ms speech frame. These parameters are 
the LSP coefficients, the two fractional pitch delays, the two fixed-codebook vectors, and the two sets of adaptive and 
fixed-codebook gains. The LSP coefficients are interpolated and converted to LP filter coefficients for each subframe. 
Then, for each 5 ms subframe the following steps are done: 

• the excitation is constructed by adding the adaptive and fixed-codebook vectors scaled by their respective 
gains; 

• the speech is reconstructed by filtering the excitation through the LP synthesis filter; 

• the reconstructed speech signal is passed through a post-processing stage, which includes an adaptive 
postfilter based on the long-term and short-term synthesis filters, followed by a high-pass filter and 
scaling operation. 
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FIGURE 3/G.729 
Principle of the CS-ACELP decoder 



2.3 Delay 

This coder encodes speech and other audio signals with 10 ms frames. In addition, there is a look-ahead of 5 ms, 
resulting in a total algorithmic delay of 15 ms. All additional delays in a practical implementation of this coder are due 
to: 

• processing time needed for encoding and decoding operations; 

• transmission time on the communication link; 

• multiplexing delay when combining audio data with other data. 

2.4 Speech coder description 

The description of the speech coding algorithm of this Recommendation is made in terms of bit-exact, fixed-point 
mathematical operations. The ANSI C code indicated in clause 5, which constitutes an integral part of this 
Recommendation, reflects this bit-exact, fixed-point descriptive approach. The mathematical descriptions of the encoder 
(clause 3), and decoder (clause 4), can be implemented in several other fashions, possibly leading to a codec 
implementation not complying with this Recommendation. Therefore, the algorithm description of the ANSI C code of 
clause 5 shall take precedence over the mathematical descriptions of clauses 3 and 4 whenever discrepancies are found. 
A non-exhaustive set of test signals, which can be used with ANSI C code, are available from the ITU. 

2.5 Notational conventions 

Throughout this Recommendation, it is tried to maintain the following notational conventions: 

• Codebooks are denoted by caligraphic characters (e.g. C). 

• Time signals are denoted by their symbol and a sample index between parenthesis [e.g. s(n)]. The symbol 
n is used as sample index. 

• Superscript indices between parenthesis (e.g. g< m) are used to indicate time-dependency of variables. The 
variable m refers, depending on the context, to either a frame or subframe index, and die variable. n to a 
sample index. 

• Recursion indices are identified by a superscript between square brackets (e.g. E^). 

• Subscripts indices identify a particular element in a coefficient array. 

• The symbol A identifies a quantized version of a parameter (e.g. g c ). 

• Parameter ranges are given between square brackets, and include the boundaries (e.g. [0.6, 0.9]). 
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• The function log denotes a logarithm with base 10. 

• The function int denotes truncation to its integer value. 

• The decimal floating-point numbers used are rounded versions of the values used in the 16 bit fixed-point 
ANSI C implementation. 

. Table 2 lists the most relevant symbols used throughout. mis Recommendation. A glossary of the most relevant signals is 
given in Table 3. Table 4 summarizes relevant variables and their dimension. Constant parameters are listed in Table 5. 
The acronyms used in this Recommendation are summarized in Table 6. 



TABLE 2/G.729 



Glossary of most relevant symbols 



Name 


Reference 


Description 


1/4(2) 


Equation (2) 


LP synthesis filter 


"Alt) 


Equation (1) 


Input high-pass filter 


H p {z) 


Equation (78) 


Long-term postfilter 


Hjiz) 


Equation (84) 


Short-term postfilter 


H£z) 


Equation (86) 


Tilt-compensation filter 




Equation (91) 


Output high-pass filter 


PU) 


Equation (46) 


Pre-filter for fixed codebook 


W(z) 


Equation (27) 


Weighting filter 
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Glossary of most relevant signals 



Name 


Reference 


Description 


<:(*) 


3.8 


Fixed-codebook contribution 


d(n) 


3.8.1 


Correlation between target signal and h(n) 


ew(n) 


3.10 


Error signal 


h(n) 


3.5 


Impulse response of weighting and synthesis filters 


rin) 


3.6 


Residual signal 


**) 


3.1 


Pre-processed speech signal 


s(n) 


4.1.6 


Reconstructed speech signal | 


s'(n) 


3.2.1 


Windowed speech signal 




4.2 


Postfiltered output 


sfXn) 


4.2 . 


Gain-scaled postfiltered output 


sw(n) 


3.6 


Weighted speech signal 


xin) 


3.6 


Target signal 


x'(n) . 


3.8.1 


Second target signal 


u(n) 


3.10 


Excitation to LP synthesis filter 


K«) 


3.7.1 


Adaptive-codebook contribution 


>in) 


3.7.3 


Convolution v(n)_* h(n) 


zM 


3.9 


Convolution c(n) * h(n) 
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TABLE 4/G.729 
Glossary of most relevant variables 



Name 


Size 


Description 


8 P 


1 


Adaptive-codebook gain j 


8c • 


1 


Fixed-codebook gain 


81 


1 


Gain term for long-term postfilter 


Sf 


. 1 


Gain term for short-term postfilter 


8t 


1 


Gain term for tilt postfilter 


G 


1 


Gain for gain normalization 


T 

*op 


1 


Open-loop pitch delay 




11 


LP coefficients (oq = 1.0) 


*/ 


10 


Reflection coefficients 


*1 


1 


Reflection coefficient for tilt postfilter 


Of 


2 


LAR coefficients 


CO- 

I 


10 


LSF normalized frequencies 


A 

p.,; 


40 


MA predictor for LSF quantization 




10 


LSP coefficients 


rik) 


11 


Auto-correlation coefficients 


r'(k) 


11 


Modified auto-correlation coefficients 




10 


LSP weighting coefficients 


A 

h 


10 


LSP quantizer output 



TABLE 5/G.729 
Glossary of most relevant constants 



Name 


Value 


Description 


fs 


8000 


Sampling frequency 


' fo 


60 


Bandwidth expansion 


Yi 


0.94/0.98 


Weight factor perceptual weighting filter 


Y2 


0.60/[0.4-0.7] 


Weight factor perceptual weighting filter 


In 


0.55 


Weight factor postfilter 


Id 


0.70 


Weight factor postfilter \ 


V 


0.50 


Weight factor pitch postfilter 




0.90/0.2 


Weight factor tilt postfilter 


e 


Table 7 


Fixed (algebraic) codebook 


L0 


3.2.4 


Moving-average predictor codebook 


LI 


3.2.4 


First stage LSP codebook 


L2 


3.2.4 


Second stage LSP codebook (low part) 


L3 


3.2.4 


Second stage LSP codebook (high part) 


bA 


3.9 


Gain codebook (first stage) 


b(S 


3.9 


Gain codebook (second stage) 




Equation (6) 


Correlation lag window 


w lp 


Equation (3) 


LP analysis window 
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TABLE. 6/G.729 
Glossary of acronyms 



Acronym 


Description 


CELP 


Code-Excited Linear-Prediction 


CS-ACELP 


Conjugate-Structure Algebraic-CELP 


MA 


Moving Average 


MSB 


Most Significant Bit 


MSE 


Mean-Squared Error 


LAR 


Log Area Ratio 


LP 


Linear Prediction 


LSP 


Line Spectral Pair 


LSF 


Line Spectral Frequency 


VQ 


Vector quantization 



3 Functional description of the encoder 

In this clause the different functions of the encoder represented in the blocks of Figure 2 are described. A detailed signal 
flow is shown in Figure 4. 

3.1 Pre-processing 

As stated in clause 2, the input to the speech encoder is assumed to be a 16 bit PCM signal. Two pre-processing 
functions are applied before the encoding process: 

1 ) signal scaling; and 

2) high-pass filtering. 

The scaling consists of dividing the input by a factor 2 to reduce the possibility of overflows in the fixed-point 
implementation. The high-pass filter serves as a precaution against undesired low-frequency components. A second 
order pole/zero filter with a cut-off frequency of 140 Hz is used. Both the scaling and high-pass filtering are combined 
by dividing the coefficients at the numerator of this filter by 2. The resulting filter is given by: 

_ 0.46363718 - 0.92724705Z' 1 + 0.463637 18z- 2 (1) 
hl{Z) 1 - 1.9059465z- 1 + 0.9114024z~ 2 

The input signal filtered through H hX (z) is referred to as s(n\ and will be used in all subsequent coder operations. 

3.2 Linear prediction analysis and quantization 

The short-term analysis and synthesis filters are based on 10th order Linear Prediction (LP) filters. 
The LP synthesis filter is defined as: 

_L = i , (2) 

where d h i = 1,...,10, are the (quantized) Linear Prediction (LP) coefficients. Short-term prediction, or linear prediction 
analysis is performed once per speech frame using the autocorrelation method with a 30 ms asymmetric window. Every 
80 samples (10 ms), the autocorrelation coefficients of windowed speech are computed and converted to the LP 
coefficients using the Levinson algorithm. Then the LP coefficients are transformed to the LSP domain for quantization 
and interpolation purposes. The interpolated quantized and unquantized filters are converted back to the LP filter 
coefficients (to construct the synthesis and weighting filters for each subframe). 
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3.2.1 Windowing and autocorrelation computation 



The LP analysis window consists of two parts: the first part is half a Hamming window and the second part is a quarter 
of a cosine function cycle. The window is given by: 



wi p (n) = < 



0.54 - 0.46 cos n = 0,..., 

(2k (n - 200)\ 
COS l 159 J 



199 

n = 200,...,239 



(3) 



There is a 5 ms lookahead in the LP analysis which means that 40 samples are needed from the future speech frame. This 
translates into an extra algorithmic delay of 5 ms at the encoder stage. The LP analysis window applies to 120 samples 
from past speech frames, 80 samples from the present speech frame, and 40 samples from the future frame. The 
windowing procedure is illustrated in Figure 5. 



LP windows 



I I I I Y///X//A I k\\\t\\\S I 1 Subframes 

T1518680-95/d05 



FIGURE 5/G.729 
Windowing procedure in LP analysis 



The different shading patterns identify corresponding excitation and LP analysis windows. 
The windowed speech: 

s'in) = wi p (n)s(n) n = 0 239 (4) 



is used to compute the autocorrelation coefficients: 



239 

r(k) = £ V(n - k) k = 0,...,10 
n=k 



(5) 



To avoid arithmetic problems for low-level input signals the value of rip) has a lower boundary of r(0) = 1.0. A 60 Hz 
bandwidth expansion is applied, by multiplying the autocorrelation coefficients with: 



wiagQQ = exp 



k = 1 10 



(6) 
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where /o = 60 Hz is the bandwidth expansion and f s = 8000 Hz is the sampling frequency. Furthermore, r(Q) is multiplied 
by a white-noise correction factor 1.0001, which is equivalent to adding a noise floor at -40 dB. The modified 
autocorrelation coefficients are given by: 

r'(0) = 1.0001 r(0) (?) 
r'(k) = w hg (k)r(k) k = 1.....10 

3.2.2 Levinson-Durbin algorithm 

The modified autocorrelation coefficients r*(k) are used to obtain the LP filter coefficients, a iy i - 1, — ,10, by solving the 
set of equations: 

10 

Ya/r'fl/ - k\) = -r'(t) k = 1 10 (8) 



The set of equations in (8) is solved using the Levinson-Durbin algorithm. This algorithm uses the following recursion: 

£t°l = r'(0) 
fori = 1 to 10 

4r ,] = r 



for j = 1 to / - 1 

flW = a?" 11 + kia { !'- 
J J 1 *-J 

end 

m = (i - *2) #-n 



The final solution is given as. fly = dj l0 \j = 0, :..,10, with a$= 1.0. 
3.23 LP to LSP conversion 

The LP filter coefficients a§ 9 i = 0,...10 are converted to Line Spectral Pair (LSP) coefficients for quantization and 
interpolation purposes. For a 10th order LP filter, the LSP coefficients are defined as the roots of the sum and difference 
polynomials: 

F,'( Z ) = A(z) + r"A(r l ) (9) 

and: 

F' 2 {z) = A(z) - r n A(r l ) (10) 
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respectively. The polynomial F[(z) is symmetric, and F^z) is antisymmetric. It can be proven that all roots of these 
polynomials are on the unit circle and they alternate each other. F[(z) has a root z = -1 (to = tt) and F^z) has a root z = 1 
(w = 0). These two roots are eliminated by defining the new polynomials: 

F,(z) = F|(z)/(1 + z-l) (11) 

and: 

F 2 (z) = Ffc)/(l - z" 1 ) (12) 

Each polynomial has five conjugate roots on the unit circle (e^i), and they can be written as: 

F\(z) = II (1 - ^r 1 + z' 2 ) (13) 
i=l,3,...,9 

and: 

^> = lid - Msr x + *~ 2 ) O 4 ) 

/ = 2,4,..., 10 

where ^ = cos(o)/). The coefficients o>; are the Line Spectral Frequencies (LSF) and they satisfy the ordering property 0 
< 0)/ < o>2 < ... < o>io < n. The coefficients q t are referred to as the LSP coefficients in the cosine domain. 

Since both polynomials F\(z) and F 2 (z) are symmetric only the first five coefficients of each polynominal need to be 
computed. The coefficients of these polynomials are found by the recursive relations: 

f x (i + 1) = aio-/ -/i(0 i = 0,...,4 

/ 2 (i + 1) = fl/+i- flio-i +/2(0 « = 0,...,4 

where /i(0) = /2(0) = 1.0. The LSP coefficients are found by evaluating the polynomials F\(z) and F 2 (z) at 60 points 
equally spaced between 0 and n and checking for sign changes. A sign change signifies the existence of a root and the 
sign change interval is then divided four times to allow better tracking of the root The Chebyshev polynomials are used 
to evaluate F { (z) and F 2 (z). In this method the roots are found directly in the cosine domain. The polynomials F\(z) or 
F 2 (z), evaluated at z = e?®, can be written as: 

F(co) = 2e'i^C{x) (16) 

with: 

C{x) = T 5 (x) + fi\)T 4 (x) + fi2)T 3 (x)+AW 2 ix) + MWiV) + AW . (17) 

where T m {x) = cos(mco) is the /nth order Chebyshev polynomial, arid /(/), i = 1,...,5, are the coefficients of either F\(z) or 
F 2 (z), computed using Equation (15). The polynomial C(x) is evaluated at a certain value of x = cos(co) using the 
recursive relation: 

for k = 4 down to I 

b k = 2xb k +\ - ** + 2 +X5 - t) 

em/ 

C(jc) = xb\ - b 2 + /(5)/2 
with initial values 65 = 1 and b& = 0. 
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3.2.4 Quantization of the LSP coefficients 



The LSP coefficients q x are quantized using the LSF representation co, in the normalized frequency domain [0, tc]; that is: 

co/ = arccosO?,-) i = 1,...,10 (18) 



A switched 4th order MA prediction is used to predict the LSF coefficients of the current frame. The difference between 
the computed and predicted coefficients is quantized using a two-stage vector quantizer. The first stage is a 
1 0-dimensional VQ using codebook Ll with 128 entries (7 bits). The second stage is a 10 bit VQ which has been 
implemented as a split VQ using two 5-dimensional codebooks, L2 and L3 containing 32 entries (5 bits) each. 

To explain the quantization process, it is convenient to first describe the decoding process. Each coefficient is obtained 
from the sum of two codebooks: 



A L1/(L1) + L2/(L2) i = 1,...,5 

h = I (19) 
[L1/(L1) + L3/- 5 tf3) i = 6,..,10 



where Ll, L2 and L3 are the codebook indices. To avoid sharp resonances in the quantized LP synthesis filter, the 
coefficients ?/ are arranged such that adjacent coefficients have a minimum distance of J. The rearrangement routine is 
shown below: 

fori = 2,...,10 

</(?/-!>?/- J) 

%- X = (ti + ?/-!- J)/2 
?i = (ti + + J)I2 

end 

end 



This rearrangement process is done twice. First with a value of J = 0.0012, then with a value of J = 0.0006! After this 
rearrangement process, the quantized LSF coefficients of J"^ for the current frame w, are obtained from the weighted sum 
of previous quantizer outputs t^ m ~ k \ and the current quantizer output t^: 



1 



*=1 



4 



IW + lPi^" 1 -® ,/» WO (20) 



where p it * are the coefficients of the switched MA predictor. Which MA predictor to use is defined by a separate bit L0. 
At start-up the initial values of art given by? / = /7C/llforalU<0. 

After computing co,-, the corresponding filter is checked for stability. This is done as follows: 

1) order the coefficient co,- in increasing value; 

2) if co,. < 0,005 then co,- = 0.005; 

3) if co/+i -co/ -0.0391 men co l + i = co/ + 0.0391, i = l 9; 

4) if co 10 > 3.135 then co 10 = 3.135. 
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The procedure for encoding the LSF parameters can be outlined as follows. For each of the two MA predictors the best 
approximation to the current LSF coefficients has to be found. The best approximation is defined as the one that 
minimizes the weighted mean-squared error: 



10 



(21) 



The weights Wj are made adaptive as a function of the unquantized LSF coefficients, 



w; = 



1.0 



(f t02 - 0.04n - 1 > 0 



10 (0)2 - 0.047C - l) 2 + 1 otherwise 



wi 2 < / < 9 = 



|1.0 

Il0(0), + , - (Of., -1)2+1 



if(Oi+\ - co, _ i - 1 > 0 
otherwise 



(22) 



r 

10 (-CO9 + 0.9271 - 1) 2 +1 



Z/-CD9 + 0.9271 - 1 > 0 
otherwise 



In addition, the weights vv 5 and w 6 are multiplied by 1.2 each. 
The vector to be quantized for the current frame m is obtained from 











4 \ 






i 












i * 


~ 1 ) 



i = 1 10 



(23) 



The first codebook LI is searched and the entry LI that minimizes the (unweighted) mean-squared error is selected. This 
is followed by a search of the second codebook L2, which defines the lower part of the second stage. For each possible 
candidate, the partial vector co ; , i = 1,...,5, is reconstructed using Equation (20), and rearranged to guarantee a minimum 
distance of 0.0012. The weighted MSE of Equation (21) is computed, and the vector L2 which results in the lowest error 
is selected. Using the selected first stage vector LI and the lower part of the second stage L2, the higher part of the 
second stage is searched from codebook L3. Again the rearrangement procedure is used to guarantee a minimum 
distance of 0.0012. The vector L3 that minimizes the weighted MSE is selected. The resulting vector ?/, / = 1,...,10 is 
rearranged to guarantee a minimum distance of 0.0006. This process is done for each of the two MA predictors defined 
by L0, and the MA predictor L0 that produces the lowest weighted MSE is selected. As was explained at the beginning 
of this clause, the resulting vector ?/ is rearranged twice and a stability check is applied to produce the quantized LSF 
coefficients oV-. 



3.2.5 Interpolation of the LSP coefficients 

The quantized (and unquantized) LP coefficients are used for the second subframe. For the first subframe, the quantized 
(and unquantized) LP coefficients are obtained by linear interpolation of the corresponding parameters in the adjacent 
subframes. The interpolation is done on the LSP coefficients in the cosine domain. Let qjcurrent) ^ me i£j> coefficients 
computed for the current 10 ms frame, and q<P revwus ) the LSP coefficients computed in the previous 10 ms frame. The 
(unquantized) interpolated LSP coefficients in each of the two subframes are given by: 



Subframe 1: = 0.5g^ revious ^ + Q.5q\ currenS ^ i = 'l 10 

Subframe!: qf ] = q^urrem) t = x w 



(24) 
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The same interpolation procedure is used for the interpolation of the quantized LSP coefficients by substituting q x by q x > 
in Equation (24). 

3.2.6 LSP to LP conversion 

Once the LSP coefficients are quantized and interpolated, they are converted back to the LP coefficients a x . This 
' conversion is done as follows. The coefficients of F x (z) and F 2 (z) are found by expanding Equations (13) and (14) 
knowing the quantized and interpolated LSP coefficients. The coefficients i = 1,...,5, are computed from q, using 
the recursive relation: 

for i = 1 to 5 

f X {i) = -2q 2i -\f\{i - D + 2/K/-2) 
for j = i - 1 down to 1 

f\ n Ql=f\ i -' ] U)-2 q2i ^- l] U-i)+f\ i - l] (j-2) 

end 

end 

with initial values /j(0) = 1 and /i(-l) = 0. The coefficients f 2 (i) are computed similarly by replacing $2M DV ?2i- 

Once the coefficients /i(0 and/2(0 are found, F\{z) and F 2 (z) are multiplied by 1 + z" 1 and 1 -z _1 , respectively, to 
obtain F[(z) and F^z); that is: 

f x (i) =/l(0 +/lO - - 1) = lv-,5 

^(0 = / 2 (0 - fid - i) /=i,...,5 . 

Finally the LP coefficients are computed from f[{i) and_^(/) by: 

f0.5/[(0 + 0.5^(0 i = 1,...,5 

ai=\ (26) 
[0.5/[(ll - 0 - 0.5^(11 ~ 0 i = 6,...,10 

This is directly derived from the relation A{z) = (Fjfe) + F' 2 {z))l2, and because Fj(z) and F' 2 (z) are symmetric and 
antisymmetric polynomials, respectively. 

3.3 Perceptual weighting 

The perceptual weighting filter is based on the unquantized LP filter coefficients a,-, and is given by: 

Mzlyi) i + £,V.^ 
."»"«./*)■ {27) 

The values of yj and y 2 determine the frequency response of the filter W(z). By proper adjustment of these variables it is 
possible to .make the weighting more effective. This is done by making Yi and y 2 a function of the spectral shape of the 
input signal. This adaptation is done once per 10 ms frame, but an interpolation procedure for each first subframe is used 
to smooth this adaptation process. The spectral shape is obtained from a 2nd order linear prediction filter, obtained as a 
by-product from the Levin son-Durbin recursion (3.2.2). The reflection coefficients kj are converted to Log Area Ratio 
(LAR) coefficients o x by: 

0i = lo g (LQ + *' } i=l,2 (28) 

(1.0 -to 
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The LAR coefficients corresponding to the current 10 ms frame are used for the second subframe. The LAR coefficients 
for the first subframe are obtained through linear interpolation with the LAR parameters from the previous frame. The 
interpolated LAR coefficients in each of the two subframes are given by: 

Subframe 1: <fp = 0.5 o^ revious ^ + 0.5 o\ curreni) i = 1,2 

' (29) 
Subframe 2: of ] = 0 \ current) i = 1,2 

The spectral envelope is characterized as being either flat (flat = 1) or tilted (flat-0). For each subframe this 
characterization is obtained by applying a threshold function to the LAR coefficients. To avoid rapid changes, a 
hysteresis is used by taking into account the value of flat in the previous subframe m - 1, 



flat^ - < 



0 if < -X.lAando^ > 0.65 andflat^' D = 1 

1 if (o^ > -1.52 or 4 m) < 0.43) and flat™- V = 0 (30) 



y7 fl f(w-l) otherwise 



If the interpolated spectrum for a subframe is classified as flat {flatW = 1), the weight factors are set to j\ = 0.94 and 
y 2 = 0.6. If the spectrum is classified as tilted (fla^ = 0), the value of Yi is set to 0.98, and the value of 72 is adapted to 
the strength of the resonances in the LP synthesis filter, but is bounded between 0.4 and 0.7. If a strong resonance is 
present, the value of 72 is set closer to the upper bound. This adaptation is achieved by a criterion based on the minimum 
distance between two successive LSP coefficients for the current subframe. The minimum distance is given by: 

dmin = m/n[G)/+i - 03/] i = 1,...,9 (31) 

The value of 72 is computed using the linear relationship: 

y 2 = -6.0d min + 1.0 bounded by 0.4 < 72 < 0.7 - (32) 

The weighted speech signal in a subframe is given by: 



10 10 

sw(n) = s(n) + X 0/7i f (* - 0 ~ X a^swin - i) n = 0,...,39 (33) 
i=\ 1=1 



The weighted speech signal sw(ri) is used to find an estimation of the pitch delay in the speech frame. 
3.4 Open-loop pitch analysis 

To reduce the complexity of the search for the best adaptive-codebook delay, the search range is limited around a 
candidate delay T op , obtained from an open-loop pitch analysis. This open-loop pitch analysis is done once per frame 
(10 ms). The open-loop pitch estimation uses the weighted speech signal sw(n) of Equation (33), and is done as follows: 
In the first step, three maxima of the correlation: 

79 

R(k) = X Mn)sw(n - k) (34) 
n=0 
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are found in the following three ranges: 



= 1: 80,...,143 
= 2: 40,...,79 
= 3: 20 39 



The retained maxima /?(/,), i = 1.....3, are normalized through: 



JTfo) = = i = 1,...,3 (35) 



The winner among the three normalized correlations is selected by favouring the delays with the values in the lower 
range. This is done by weighting the normalized correlations corresponding to the longer delays. The best open-loop 
delay T op is determined as follows: 

T op = *\ 
Wop) = 

ifR'(t 2 ) > OMWop) 
Wop) = R\t 2 ) 
Top = *2 

end 

ifR'itj) > O.S5R'(T op ) 
Wop) = R'(t 3 ) 
T op = r 3 

end 



This procedure of dividing the delay range into three sections and favouring the smaller values is used to avoid choosing 
pitch multiples. 

3.5 Computation of the impulse response 

The impulse response h(n) of the weighted synthesis filter W(z)/A(z) is needed for the search of adaptive and fixed 
codebooks. The impulse response h(n) is computed for each subframe by filtering a signal consisting of the coefficients 
of the filter A (z/y\) extended by zeros through the two filters \/A(z) and 1A4(z/y 2 ). 

3.6 Computation of the target signal 

The target signal x(n) for the adaptive-codebook search is usually computed by subtracting the zero-input response of the 
weighted synthesis filter W(z)/A(z) = A(z/y\)/[A(z)A(z/y2)] from the weighted speech signal sw(ai) of Equation (33). This 
is done on a subframe basis. 



An equivalent procedure for computing the target signal, which is used in this Recommendation, is the filtering of the LP 
residual signal r(n) through the combination of synthesis filter \IA(z) and the weighting filter ^(z/Yi)A4(z/Y2)- After 
determining the excitation for the subframe, the initial states of these filters are updated by filtering the difference 
between the residual and excitation signals. The memory update of these filters is explained in 3.10. 



16 Recommendation G.729 (03/96) 



The residual signal r(/i), which is needed for finding the target vector is also used in the adaptive-codebook search to 
extend the past excitation buffer. This simplifies the adaptive-codebook search procedure for delays less than the 
subframe size of 40 as will be explained in the next subclause. The LP residual is given by: 

10 ■ - 

r{n) = s(n) + X *P(n - 0 n = 0,...,39 (36) 



3.7 Adaptive-codebook search 

The adaptive-codebook parameters (or pitch parameters) are the delay and gain. In the adaptive-codebook approach for 
implementing the pitch filter, the excitation is repeated for delays less than the subframe length. In'the search stage, the 
excitation is extended by the LP residual to simplify the closed-loop search. The adaptive-codebook search is done every 
(5 ms) subframe. In the first subframe, a fractional pitch delay T\ is used with a resolution of 1/3 in the range of 

12 

[ 1°3 , 84^ ] and integers only in the range [85, 143]. For the second subframe, a delay T 2 with a resolution of 1/3 is 

2 2 

always used in the range int(T\) - 5j , int(T\) + 4j, where int(T\) is the integer part of the fractional pitch delay T { of the 
first subframe. This range is adapted for the cases where T\ straddles the boundaries of the delay range. 

For each subframe the optimal delay is determined using closed-loop analysis that minimizes the weighted mean-squared 
error. In the first subframe the delay T\ is found by searching a small range (six samples) of delay values around the 
open-loop delay T op (see 3.4). The search boundaries t min and are defined by: 

tmin ~ T 0 p — 3 

if tmi„ < 20 then t min = 20 

tmax ~ tmin + 6 

if tmax > 143 then 
tmax = 143 
tmin = tmax ~ 6 

end 

For the second subframe, closed-loop pitch analysis is done around the pitch selected in the first subframe to find the 

2 2 

optimal delay T 2 . The search boundaries are between t min - 3 and + 3, where t mi „ and are derived from T\ as 
follows: 

tmin = int(T\) -5 

if tmin < 20 then t min = 20 

tmax = t m i n + 9 . 

if tmax > 143 then 

tmax =143 
tmin = hnax ~ 9 

end 
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The closed-loop pitch search minimizes the mean-squared weighted error between the original and reconstructed speech. 
This is achieved by maximizing the term: 

PV(n)y*(n) 

R(k) = -=(£2 = (37) 



where x(n) is the target signal and y*(n) is the past filtered excitation at delay k [past excitation convolved with h(n)]. 
Note that the search range is limited around a preselected value, which is the open-loop pitch T op for the first subframe, 
and 7*! for the second subframe. 

The convolution y k (n) is computed for the delay t min . For the other integer delays in the search range k = t min + \ 9 ...j max , 
it is updated using the recursive relation: 

yk(n) = y M (n - 1) + u(-k)h(n) n = 39,...,0 (38) 



where u(n), n = -143, ...,39, is the excitation buffer, and yjt-i(-l) = 0. Note that in the search stage, the samples w(n), 
n = 0,...,39 are not known, and they are needed for pitch delays less than 40. To simplify the search, the LP residual is 
copied to u(n) to make the relation in Equation (38) valid for all delays. 

For the determination of Ti, and T\ if the optimum integer closed-loop delay is less than 85, the fractions around the 
optimum integer delay have to be tested. The fractional pitch search is done by interpolating the normalized correlation 
in Equation (37) and searching for its maximum. The interpolation is done using a FIR filter b\2 based on a Hamming 
windowed sine function with the sine truncated at ± 11 and padded with zeros at ± 12 (fci2(12) = 0). The filter has its 
cut-off frequency (-3 dB) at 3600 Hz in the oversampled domain. The interpolated values of R(k) for the fractions ~ 

- 1, 0, j and | are obtained using the interpolation formula 

3 3 
R(k) t = X #(* - i)b x2 (t + 3f) + £ JK* + 1 + 0^12(3 - t + 3i) t = 0, 1, 2 (39) 
r=0 r=0 



where / = 0, 1,2 corresponds to the fractions 0, | and |, respectively. Note that it is necessary to compute the correlation 
terms in Equation (37) using a range t min - 4, + 4, to allow for the proper interpolation. 

3.7.1 Generation of the adaptive-codebook vector 

Once the pitch delay has been determined, the adaptive-codebook vector v(n) is computed by interpolating the past 
excitation signal u(n) at the given integer delay k and fraction t: 

9 9 
v(n) =X«(n-H />3o(f + 3/) + X u(n - k + 1 + 0*30(3 - 1 + 3/) n = 0,...,39 t = 0, 1, 2 (40) 
i=0 /=0 



The interpolation filter £30 is based on a Hamming windowed sine functions truncated at ± 29 and padded with zeros at 
± 30 [J>3o(30) = 0]. The filter has a cut-off frequency (-3 dB) at 3600 Hz in the oversampled domain. 
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3.7.2 Codeword computation for adaptive- codebook delays 



The pitch delay T\ is encoded with 8 bits in the first subframe and the relative delay in the second subframe is encoded 
with 5 bits. A fractional delay T is represented by its integer part m/(7), and a fractional part frac/3,frac = -1 ,0,1. The 
pitch index PI is now encoded as: 



[3^1) - 19) +/roc - 1 // h = [19,...,85], frac = [-1,0, 1] 
PI = \ (41) 
[(intffi) - 85) + 197 if T } = [86,...,143], frac = 0 



The value of the pitch delay T 2 is encoded relative to the value of T\. Using the same interpretation as before, the 
fractional delay T 2 represented by its integer part int(r 2 ), and a fractional part frac/3, frac = -1,0,1, is encoded as: 



P2 = 3(mr(r 2 ) - t min ) + frac + 2 (42) 

where f m ; n is derived from T\ as in 3.7. 

To make the coder more robust against random bit errors, a parity bit P0 is computed on the delay index P\ of the first 
subframe. The parity bit is generated through an XOR operation on the six most significant bits of PI. At the decoder 
this parity bit is recomputed and if the recomputed value does not agree with the transmitted value, an error concealment 
procedure is applied. 

3.7.3 Computation of the adaptive-codebook gain 

Once the adaptive-codebook delay is determined, the adaptive-codebook gain g p is computed as: 

bounded by 0 < g p < 1.2 (43) 



I^ 0 y(n)y{n) 



where x(n) is the target signal and y(n) is the filtered adaptive-codebook vector [zero-state response of W(z)/A(z) to v(n)]. 
This vector is obtained by convolving v(n) with h(n): 



n 

y(n) = X v(i)h(n - i) n = 0,...,39 (44) 
i=0 



3.8 Fixed codebook - Structure and search 



The fixed codebook is based on an algebraic codebook structure using an Interleaved Single-Pulse Permutation (ISPP) 
design. In this codebook, each codebook vector contains four non zero pulses. Each pulse can have either the amplitudes 
+1 or -1, and can assume the positions given in Table 7. 
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TABLE 7/G.729 
Structure of Fixed codebook C 



Pulse 


Sign 


Positions 


h 




mo: 0,5, 10, 15,20, 25,30,35 


h 


±1 


m,: 1,6, 11, 16,21,26,31,36 


h 


s 2 : ±1 


m 2 : 2, 7, 12, 17, 22, 27,32, 37 


h 


s 3 : ±1 


m 3 : 3, 8,13, 18, 23, 28,33, 38 
4,9, 14, 19, 24, 29,34, 39 



The codebook vector c(n) is constructed by taking a zero vector of dimension 40, and putting the four unit pulses at the 
found locations, multiplied with their corresponding sign: 



c(n) = so5(n - mo) + s\8(n - m\) + - m?) + s^b(n - m$) n = 0,...,39 



(45) 



where 5(0) is a unit pulse. A special feature incorporated in the codebook is that the selected codebook vector is filtered 
through an adaptive pre-filter P(z) that enhances harmonic components to improve the quality of the reconstructed 
speech. Here the filter: 



P(z) = 1/(1 - Pz- 7 ) 



(46) 



is used, where T is the integer component of the pitch delay of the current subframe, and p is a pitch gain. The value of P 
is made adaptive by using the quantized adaptive-codebook gain from the previous subframe, that is: 



P _ |^m-l) bounded by 0.2 < p < 0.8 



(47) 



For delays less than 40, the codebook c(n) of Equation (45) is modified according to: 



On) = 



| c(n) n = 0,.. M r - 1 

[c(n) + pc(n - T) n = 7,.. .,39 



(48) 



This modification is incorporated in the fixed-codebook search by modifying the impulse response h{n) according to: 



h{n) = 



\h(n) n = 0,...,7 - 1 

[h(n) + p/i(n - T) n = T,...,39 



(49) 



3.8.1 Fixed-codebook search procedure 

The fixed codebook is searched by minimizing the mean-squared error between the weighted input speech sw(n) of 
Equation (33) and the weighted reconstructed speech. The target signal used in the closed-loop pitch search is updated 
by subtracting the adaptive-codebook contribution. That is: 



x'in) = x{n) -gpjtn) n = 0,...,39 



(50) 



where y(ri) is the filtered adaptive-codebook vector of Equation (44) and g p the adaptive-codebook gain of 
Equation (43). 
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The matrix H is defined as the lower triangular Toepliz convolution matrix with diagonal h(0) and lower diagonal 
/i(l),...,/i(39). The matrix <J> = IfH contains the correlations of h(n) y and the elements of this symmetric matrix are given 
by: 

39 . * 
<K'J) = X Kn - i)h(n -j) i = 0,...,39 j = i,...,39 (51) 



The correlation signal d(n) is obtained from the target signal x*(n) and the impulse response h{n) by: 



39 

d(n) = X ^(0*(f - ») » = 0.....39 (52) 



If is the hh fixed-codebook vector, then the codebook is search by maximizing the term: 

2 (Ll 9 =o d ^ c ^ 2 



(53) 



where t denotes transpose. 

The signal d{n) and the matrix O are computed before the codebook search. Note that only the elements actually needed 
are computed and an efficient storage procedure has been designed to speed up the search procedure. 

The algebraic structure of the codebook C allows for a fast search procedure since the codebook vector c* contains only 
four non zero pulses. The correlation in the numerator of Equation (53) for a given vector is given by: 

3 

C = X sjUmd (54) 
; = 0 . 



where m,- is the position of the /th pulse and Sj is its amplitude. The energy in the denominator of Equation (53) is given 
by: 

3 2 3 

E = X m i) + 2 X X Sisfiinti, raj) (55) 
/=0 i=0 ;=/'+ 1 



To simplify the search procedure, the pulse amplitudes are predetermined by quantizing the signal d(n). This is done by 
setting the amplitude of a pulse at a certain position equal to the sign of d(n) at the position. Before the codebook search, 
the following steps are done. First, the signal d(ri) is decomposed into two parts: its absolute value \d(n)\ and its sign sign 
[d(n)]. Second, the matrix O is modified by including the sign information; that is, 

mj) = sign [</(/)] sign [d(j)MiJ) i = 0,...,39 j = i + 1,...,39 (56) 

The main-diagonal elements of O are scaled to remove the factor 2 in Equation (55) 

<j>'(/,0 = 0.5*U0 i = 0,...,39 (57) 
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The correlation in Equation (54) is now given by: 

C = \d{,no)\ + \d(m X )\ + \d{m 2 )\ + \d(m 3 )\ 



(58) 



and the energy in Equation (55) ii 

Ell = 
+ 
+ 
+ 



A focused search approach is used to further simplify the search procedure. In this approach a precomputed threshold is 
tested before entering the last loop, and the loop is entered only if this threshold is exceeded. The maximum number of 
times the loop can be entered is fixed so that a low percentage of the codebook is searched. The threshold is computed 
based on the correlation C. The maximum absolute correlation and the average correlation due to the contribution of the 
first three pulses, max^ and av^ are found before the codebook search. The threshold is given by: 

thr?> = av$ + K^max^ - avy) (60) 

The fourth loop is entered only if the absolute correlation (due to three pulses) exceeds thr$, where 0 < K$ < 1. The value 
of K$ controls the percentage of codebook search and it is set here to 0.4. Note that mis results in a variable search time. 
To further control the search the number of times the last loop is entered (for the two subframes) cannot exceed a certain 
maximum, which is set here to 180 (the average worst case per subframe is 90 times). 

3.8.2 Codeword computation of the fixed codebook 

The pulse positions of the pulses i 0 , i\ and /*2, are encoded with 3 bits each, while the position of i3 is encoded with 4 
bits. Each pulse amplitude is encoded with 1 bit This gives a total of 17 bits for the 4 pulses. By defining s = 1 if the 
sign is positive and s = 0 if the sign is negative, the sign codeword is obtained from: 

S = 50 + 2*i + 4*2 + 853 (61) 

and the fixed-codebook codeword is obtained from: 

C = (mo/5) + 8(m!/5) + 64(m 2 /5) + 512(2(^5) + jx) (62) 

where jx = 0 if m 3 = 3,8,...,38, and jx = 1 if m 3 = 4,9...,39. 
3.9 Quantization of the gains 

The adaptive-codebook gain (pitch gain) and the fixed-codebook gain are vector quantized using 7 bits. The gain 
codebook search is done by minimizing the mean-squared weighted error between original and reconstructed speech 
which is given by: 

E = x*x + g^yty + ^z'z - Igp'y - Ig^z + Igpg^z (63) 

where x is the target vector (see 3.6), y is the filtered adaptive-codebook vector of Equation (44), and z is the fixed- 
codebook vector convolved with /i(/i), 

n 

z(n) = X <*0Kn - /) n = 0,...,39 (64) 
i=0 



s given by: 
<t>'(mo, mo) 

<b'(m\,m\) + <|>'(mo,/ni) 

<|>'(/n2, m 2) + <t>'('wo>"*2) + ty'Onumi) 

<t>'0»3i *"3) + V(ntQ,mj) + $'(m h mi) + <|>'(/n2. m l) 



(59) 
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3.9.1 Gain prediction 

The fixed-codebook gain g c can be expressed as: 



8c = Y£ (65) 

where g' c is a predicted gain based on previous fixed-codebook energies, and y is a correction factor. 
The mean energy of the fixed-codebook.contribution is given by: 



( 1 39 



£= 10 log I <WH W 



After scaling the vector c(n) with the fixed-codebook gain g c , the energy of the scaled fixed codebook is given by 20 log 
g c + E. Let £^ m) be the mean-removed energy (in dB) of the (scaled) fixed-codebook contribution at the subframe m, 
given by: 



£< m ) = 20 log g c + E - E (67) 



where E = 30 dB is the mean energy of the fixed-codebook excitation. The gain g c can be expressed as a function of 
£< m >,£and£by: 



gc = iflCfi^+E-E)/* (68) 



The predicted gain g' c is found by predicting the log-energy of the current fixed-codebook contribution from the log- 
energy of previous fixed-codebook contributions. The 4th order MA prediction is done as follows. The predicted energy 
is given by: 

4 

£(m) = £fc f 0(m-i) (69) 



where [b\ bi b$] = [0.68 0.58 0.34 0.19] are the MA prediction coefficients, and £K m) is the quantized version of the 
prediction error lA m ) at subframe m, defined by: 

jjitn) - £(m) _ £{m) (70) 



The predicted gain g' c is found by replacing by its predicted value in Equation (68). 

g ' c = i 0 (^ m) +£-£)/20 (71) 

The correction factor y is related to the gain-prediction error by: 

Uim) = £<m) _ £(/n) = 2 Q log(y) (72) 
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3.9.2 Codebook search for gain quantization 



The adaptive-codebook gain, g p , and the factor y are vector quantized using a two-stage conjugate structured codebook. 
The first stage consists of a 3 bit two-dimensional codebook bA> and the second stage consists of a 4 bit two-dimensional 
codebook b(S. The first element in each codebook represents the quantized adaptive-codebook gain gp and the second 
element represents the quantized fixed-codebook gain correction factor y. Given codebook indices GA and GB for bA 
and M, respectively, the quantized adaptive-codebook gain is given by: 

g p = M\(GA) + Mi(GB) (73) 



and the quantized fixed-codebook gain by: 

gc = g' c y = g' c (bA 2 (GA) + M 2 (GB)) . (74) 

This conjugate structure simplifies the codebook search, by applying a pre-selection process. The optimum pitch gain g p , 
and fixed-codebook gain, g c , are derived from Equation (63), and are used for the pre-selection. The codebook bA 
contains eight entries in which the second element (corresponding to g c ) has in general larger values than the first 
element (corresponding to g p ). This bias allows a pre-selection using the value of g c . In this pre-selection process, a 
cluster of four vectors whose second elements are close to g c are selected. Similarly, the codebook b(E contains 16 entries 
in which each has a bias towards the first element (corresponding to g p ). A cluster of eight vectors whose first elements 
are close to g p are selected. Hence for each codebook the best 50% candidate vectors are selected. This is followed by an 
exhaustive search over the remaining 4 x 8 = 32 possibilities, such that the combination of the two indices minimizes the 
weighted mean -squared error of Equation (63). 

3.9.3 Codeword computation for gain quantizer 

The codewords GA and GB for the gain quantizer are obtained from the indices corresponding to the best choice. To 
reduce the impact of single bit errors the codebook indices are mapped. 

3.10 Memory update 

An update of the states of the synthesis and weighting filters is needed to compute the target signal in the next subframe. 
After the two gains are quantized, the excitation signal, u(n), in the present subframe is obtained using: 

u(n) = g p v(n) + g c c(n) n = 0,...,39 . (75) 

where g p and g c are the quantized adaptive and fixed-codebook gains, respectively, v(n) is the adaptive-codebook vector 
(interpolated past excitation), and c(/i) is the fixed-codebook vector including harmonic enhancement. The states of the 
filters can be updated by filtering the signal r(n)-u(n) (difference between residual and excitation) through the filters 
\/A(z) and A(z/y\)/A(z/y2) for the 40 sample subframe and saving the states of the filters. This would require three filter 
operations. A simpler approach, which requires only one filter operation, is as follows. The locally reconstructed speech 
s(n) is computed by filtering the excitation signal through \IA(z). The output of the filter due to the input r(n) - u(n) is 
equivalent to e(n) = s(n) - S(n). So the states of the synthesis filter \IA{z) are given by e(n\ n = 30,.. .,39. Updating the 
states of the filter A{zh(\)IA (zlyi) can be done by filtering the error signal e(n) through this filter to find the perceptually 
weighted error ew(n). However, the signal ew(n) can be equivalently found by: 

ew(n) = x(n) - g p y(n) - g c z(n) (76) 

Since the signals x(n), y(n) and z(n) are available, the states of the weighting filter are updated by computing ew(n) as in 
Equation (76) for n = 30,. ..,39. This saves two filter operations. 
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4 Functional description of the decoder 



The principle of the decoder was shown in clause 2 (Figure 3). First the parameters are decoded (LP coefficients, 
adaptive-codebook vector, fixed-codebook vector and gains). The transmitted parameters are listed in Table 8. These 
decoded parameters are used to compute the reconstructed speech signal as will be described in 4.1. This reconstructed 
signal is enhanced by a post-processing operation consisting of a postfilter, a high-pass filter and an upscaling (see 4.2). 
Subclause 4.4 describes the error concealment procedure used when either a parity error has occurred, or when the frame 
erasure flag has been set A detailed signal flow diagram of the decoder is shown in Figure 6. 



TABLE 8/G.729 

Description of transmitted parameters indices - The bitstream ordering is reflected by the order 
in the table - For each parameter the Most Significant Bit (MSB) is transmitted first 



Symbol 


Description 


Bits 


LO 


Switched MA predictor of LSP quantizer 


1 


LI 


First stage vector of quantizer 


7 


L2 


Second stage lower vector of LSP quantizer 


5 


L3 


Second stage higher vector of LSP quantizer 


5 




Pitch delay first subframe 


8 


PO 


Parity bit for pitch delay 


1 


CI 


Fixed codebook first subframe 


13 


S\ 


Signs of fixed-codebook pulses 1st subframe 


4 


GA\ 


Gain codebook (stage 1) 1st subframe 


3 


GB\ 


Gain codebook (stage 2) 1st subframe 


4 


P2 


Pitch delay second subframe 


5 


C2 


Fixed codebook 2nd subframe 


13 


S2 


Signs of fixed-codebook pulses 2nd subframe 


4 


GA2 


Gain codebook (stage 1) 2nd subframe 


3 


GB2 


Gain codebook (stage 2) 2nd subframe 


4 



4.1 Parameter decoding procedure 

The decoding process is done in the following order. 
4.1.1 Decoding of LP filter parameters 

The received indices LO, LI , L2 and L3 of the LSP quantizer are used to reconstruct the quantized LSP coefficients using 
the procedure described in 3.2.4. The interpolation procedure described in 3.2.5 is used to obtain two sets of interpolated 
LSP coefficients (corresponding to two subframes). For each subframe, the interpolated LSP coefficients are converted 
to LP filter coefficients a,-, which are used for synthesizing the reconstructed speech in the subframe. 

The following steps are repeated for each subframe: 

1 ) decoding of the adaptive-codebook vector; 

2) decoding of the fixed-codebook vector; 

3) decoding of the adaptive and fixed-codebook gains; 

4) computation of the reconstructed speech. 
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FIGURE 6/G.729 
Signal now at the CS-ACELP decoder 



4.1.2 Computation of the parity bit 

Before the excitation is reconstructed, the parity bit is recomputed from the adaptive-codebook delay index PI 
(see 3.7.2). If this bit is not identical to the transmitted parity bit P0, it is likely that bit errors occurred during 
transmission. 

If a parity error occurs on PI, the delay value T\ is set to the integer part of the delay value T2 of the previous frame. The 
value 7*2 is derived with the procedure outlined in 4.1.3, using this new value of T\. 
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4.1.3 Decoding of the adaptive-cod ebook vector 



If no parity error has occurred the received adaptive-codebook index PI is used to find the integer and fractional parts of 
the pitch delay T\. The integer part mr^i) and fractional part frac of T\ are obtained from PI as follows: 

if PI < 197 

int(T x ) = (PI + 2)/3 + 19 
frac = PI - 3 int(T\) + 58 

else 

int(T\) = PI - 112 
frac = 0 

end 

The integer and fractional part of T 2 are obtained from P2 and t m j n , where t min is derived from T\ as follows: 

tmin = int(T\) - 5 

if tmin < 20 f/ien t min = 20 

'/'max > 143 f/iert 
^max = 143 
tmin ~ tniax ~~ ^ 

Now ^2 is decoded using: 

inKT 2 ) = (P2 + 2)/3 - 1 + t min 
frac = P2 - 2 - 3 ((P2 + 2)/3 - 1) 

The adaptive-codebook vector v(n) is found by interpolating the past excitation u(n) (at the pitch delay) using 
Equation (40). 

4.1.4 Decoding of the fixed-codebook vector 

The received fixed-codebook index C is used to extract the positions of the excitation pulses. The pulse signs are 
obtained from S. This is done by reversing the process described in 3.8.2. Once the pulse positions and signs are decoded 
the fixed-codebook vector c(n) is constructed using Equation (45). If the integer part of the pitch delay T is less than the 
subframe size 40, c(n) is modified according to Equation (48). 

4.1.5 Decoding of the adaptive and fixed-codebook gains 

The received gain-codebook index gives the adaptive-codebook gain g p and the fixed-codebook gain correction factor y. 
This procedure is described in detail in 3.9. The estimated fixed-codebook gain g' c is found using Equation (71). The 
fixed-codebook vector is obtained from the product of the quantized gain correction factor with this predicted gain 
Equation (74). The adaptive-codebook gain is reconstructed using Equation (73). 
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4.1.6 Computing the reconstructed speech 

The excitation u(n) [see Equation (75)] is input to the LP synthesis filter. The reconstructed speech for the subframe is 
given by: 

10 

s(n) = u(n) - X &iHn - 0 n = 0,...,39 (77) 
i= 1 

where 0, are the interpolated LP filter coefficients for the current subframe. The reconstructed speech s(n) is then 
processed by the post processor described in the next subclause. 

4.2 Post-processing 

Post-processing consists of three functions: adaptive postfiltering, high-pass filtering and signal upscaling. The adaptive 
postfilter is the cascade of three filters: a long-term postfilter H p (z)> a short-term postfilter Hf(z) and a tilt compensation 
filter H t (z), followed by an adaptive gain control procedure. The postfilter coefficients are updated every 5 ms subframe. 
The postfiltering process is organized as follows. First, the reconstructed speech J(/?) is inverse filtered through A(z/Y n ) 
to produce the residual signal r(w). This signal is used to compute the delay T and gain g t of the long-term postfilter. 
H p (z). The signal r(w) is then filtered through the long-term postfilter H p (z) and the synthesis filter l/[g/A(z/y</)]. Finally, 
the output signal of the synthesis filter l/[g/A(z/fa)] is passed through the tilt compensation filter H t (z) to generate the 
postfiltered reconstructed speech signal sj{n). Adaptive gain control is then applied to sfin) to match the energy of s(n). 
The resulting signal sf\n) is high-pass filtered and scaled to produce the output signal of the decoder. 

4.2.1 Long-term postfilter 

The long-term postfilter is given by: 

H P (Z) =TT^(1 + IpSIZ- 7 ) (78) 

where T is the pitch delay, and g/ is the gain coefficient. Note that g/ is bounded by 1, and it is set to zero if the long-term 
prediction gain is less than 3 dB. The factor y p controls the amount of long-term postfiltering and has the value of 
Y p = 0.5. The long-term delay and gain are computed from the residual signal r(n) obtained by filtering the speech s(n) 
through A(s/y„), which is the numerator of the short-term postfilter (see 4.2.2). 

10 

Hn) = S{n) + " 0 (79) 

i = l 



The long-term delay is computed using a two-pass procedure. The first pass selects the best integer Tq in the range 
[intiJx) - 1, int(T\) +1], where int(T x ) is the integer part of the (transmitted) pitch delay T x in the first subframe. The 
best integer delay is the one that maximizes the correlation. 

39 

R(k) = X Hn)r(n - k) (80) 
n=0 



The second pass chooses the best fractional delay T with resolution 1/8 around Tq. This is done by finding the delay with 
the highest pseudo-normalized correlauon 
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where r^n) is the residual signal at delay L Once the optimal delay T is found, the corresponding correlation R'(T) is 
normalized with the square-root of the energy of r(n). The squared value of this normalized correlation is used to 
determine if the long-term postfilter should be disabled. This is done by setting g/ = 0 if: 

R'(T)^ 

V39 V-^-x < 0.5 (82) 



-n = 0 



r(n)?(n) 



Otherwise the value of g\ is computed from: 



Xf-o ***** 

81 = E 39 fonyfon) b ° Unded by ° " 81 ~ 10 (83) 



The non-integer delayed signal r k (n) is first computed using an interpolation filter of length 33. After the selection of T, 
r k (n) is recomputed with a longer interpolation filter of length 129. The new signal replaces the previous one only if the 
longer filter increases the value of R'(T). 

4.2.2 Short-term postfilter 

The short-term postfilter is given by: 

"i® " . gf a (z i -u) - «/ 1 + x;» rM' ' 



where A(z) is the received quantized LP inverse filter (LP analysis is not done at the decoder) and the factors y„ and y d 
control the amount of short-term postfiltering, and are set to y n = 0.55, and y d - 0.7. The gain term gf is calculated on the 
truncated impulse response hfin) of the filter A (z/y n )IA(z/yd) and is given by: 

19 

n=0 



4.23 Tilt compensation 

The filter H t (z) compensates for the tilt in the short-term postfilter Hj{z) and is given by: 

H,(z) = j f (1 + Y^z" 1 ) (86) 

where y t k\ is a tilt factor /:{ being the first reflection coefficient calculated from hj{n) with 

19-/ 

*l = ^ = ^ ^ ^ + 0 (87) 



The gain term g, = 1 - (v,A:j| compensates for the decreasing effect of gf in Hfiz). Furthermore, it has been shown that the 
product filter Hj(z)Hfe) has generally no gain. Two values for y t are used depending on the sign of k\. If k\ is negative, 
y, = 0.9, and if k\ is positive, y t - 0.2. 
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4.2.4 Adaptive gain controll 



Adaptive gain control is used to compensate for gain differences between the reconstructed speech signal s(n) and the 
postfiltered signal sj{n). The gain scaling factor G for the present subframe is computed by: 



G = 



I» 0 IM 



I» 0 k/Wl 



(88) 



The gain-scaled postfiltered signal sf(ri) is given by: 

sf(n) = gp>sf(n) n = 0.....39 

where is updated on a sample-by-sample basis and given by: 

= 0.85 + 0.15 G n = 0,...,39 



(89) 



(90) 



The initial value of g (_1) = 1.0 is used. Then for each new subframe, is set equal to g (39) of the previous subframe. 
4.2.5 High-pass filtering and upscaling 

A high-pass filter with a cut-off frequency of 100 Hz is applied to the reconstructed postfiltered speech sf(n). The filter 
is given by: 



H h 2(z) = 



0.93980581 - 1.8795834Z"* 1 + 0.9398058 W 1 



1 - 1.9330735*- 1 + 0.93589199z~ 2 

The filtered signal is multiplied by a factor 2 to restore the input signal level. 
4.3 Encoder and decoder initialization 

All static encoder and decoder variables should be initialized to 0, except the variables listed in Table 9. ' 



(91) 



TABLE 9/G.729 
Description of parameters with non-zero initialization 



Variable 


Reference 


Initial value 


p 


3.8 


0.8 




4.2.4 


1.0 


% 


3.2.4 


ZTT/ll 


m 


3.2.4 


arccos(m/ll) 


0w 


3.9.1 


-14 



4.4 Concealment of frame erasures 

An error concealment procedure has been incorporated in the decoder to reduce the degradation in the reconstructed 
speech because of frame erasures in the bitstream. This error concealment process is functional when the frame of coder 
parameters (corresponding to a 10 ms frame) has been identified as being erased. The mechanism for detecting frame 
erasures is not defined in the Recommendation, and will depend on the application. 
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The concealment strategy has to reconstruct the current frame, based on previously received information. The method 
replaces the missing excitation signal with one of similar characteristics, while gradually decaying its energy. This is 
done by using a voicing classifier based on the long-term prediction gain, which is computed as part of the long-term 
postfilter analysis. TKe long-term postfilter (see 4.2.1) finds the long-term predictor for which the prediction gain is more 
than 3 dB. This is done by setting a threshold of 0.5 on the squared normalized correlation of (Equation 82). For the 
error concealment process, a 10 ms frame is declared periodic if at least one 5 ms subframe has a long-term prediction 
gain of more than 3 dB. Otherwise the frame is declared non-periodic. An erased frame inherits its class from the 
preceding (reconstructed) speech frame. Note that the voicing classification is continuously updated based on this 
reconstructed speech signal. 

The specific steps taken for an erased frame are: 

1 ) repetition of the synthesis filter parameters; 

2) attenuation of adaptive and fixed-codebook gains; 

3) attenuation of the memory of the gain predictor; 

4) generation of the replacement excitation. 

4.4.1 Repetition of synthesis filter parameters 

The synthesis filter in an erased frame uses the LP parameters of the last good frame. The memory of the MA LSF 
predictor contains the values of the received codewords /,-. Since the codeword is not available for the current frame m, it 
is computed from the repeated LSF parameters to, and the predictor memory using: 









f 


4 ^ 


fc- 




1 


1 


- XA,* 








I 


fc-1 ) 



where the MA predictor coefficients p t k are those of the last received good frame. 
4.4.2 Attenuation of adaptive and fixed-codebook gains 

The fixed-codebook gain is based on an attenuated version of the previous fixed- codebook gain and is given by: 

g W = 0.984 m ~ 1} (93) 



where in is the subframe index. The adaptive-codebook gain is based on an attenuated version of the previous adaptive- 
codebook gain and is given by: 



g (m) = o.9g£ m " 0 bounded by g< m) < 0.9 



(94) 



4.43 Attenuation of the memory of the gain predictor . 

As was described in 3.9 the gain predictor uses the energy of previously selected fixed-codebook vectors c(n). To avoid 
transitional effects at the decoder, once good frames are received, the memory of the gain predictor is updated with an 
attenuated version of the codebook energy. The value of for the current subframe m is set to the averaged quantized 
gain prediction-error, attenuated by 4 dB: 



0O») = 



4 \ 
0.25 X tf (m ~° 

w J 



- 4.0 bounded by 0(") > -14 



(95) 
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4.4.4 Generation of the replacement excitation 



The excitation used depends on the periodicity classification. If the last reconstructed frame was classified as periodic, 
the current frame is considered to be periodic as well. In that case only the adaptive codebook is used, and the fixed- 
codebook contribution is set to zero.. The pitch delay is based on the integer part of the pitch delay in the previous frame, 
and is repeated for each successive frame. To avoid excessive periodicity the delay is increased by one for each next 
subframe but bounded by 143. The adaptive- codebook gain is based on an attenuated value according to Equation (94). 

If the last reconstructed frame was classified as non-periodic, the current frame is considered to be non-periodic as well, 
and the adaptive-codebook contribution is set to zero. The fixed-codebook contribution is generated by randomly 
selecting a codebook index and sign index. The random generator is based on the function 

seed =31821 seed + 13849 (96) 



with the initial seed value of 21845. The fixed-codebook index is derived from the 13 least significant bits of the next 
random number. The fixed-codebook sign is derived from the 4 least significant bits of the next random number. The 
fixed-codebook gain is attenuated according to Equation (93). 



5 Bit-exact description of the CS-ACELP coder 

ANSI C code simulating the CS-ACELP coder in 16 bit fixed-point is available from ITU-T. The following subclauses 
summarize the use of this simulation code, and how the software is organized. 

5.1 Use of the simulation software 

The C code consists of two main programs coder.c, which simulates the encoder, and decoder.c, which simulates the 
decoder. The encoder is run as follows: 

coder inputfile bitstreamfile 

The input file and output file are sampled data files containing 16 bit PCM signals. The decoder is run as follows: 

decoder bitstreamfile outputfile 
The mapping table of the encoded bitstream is contained in the simulation software. 

5.2 Organization of the simulation software 

In the fixed-point ANSI C simulation, only two types of fixed-point data are used as is shown in Table 10. To facilitate 
the implementation of the simulation code, loop indices, Boolean values and flags use the type Flag, which would be 
either 16 bits or 32 bits depending on the target platform. 



TABLE 10/G.729 
Data types used in ANSI C simulation 



Type 


Maximal value 


Minimal value 


Description 


Word 16 


0x7fff 


0x8000 


Signed 2's complement 16-bit word 


Word32 


0x7fffffffL 


0x80000000L 


Signed 2's complement 32-bit word 
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All the computations are done using a predefined set of basic operators. The description of these operators is given in 
Table 11. The tables used by the simulation coder are summarized in Table 12. These main programmes use a library of 
routines that are summarized in Tables 13, 14 and 15. 



TABLE 11/G.729 
Basic operations used in ANSI C simulation 



Ooeration 


Descrintion 


Wordlo sature(wora3z Levari ) 


Limit to 1 0 DltS 


Word 1 o add( word 1 o var l , word 1 o varZ) 


Short addition 


Wordlo sub(Wordlo varl, Wordlo varz) 


Short subtraction 


Word 1 6 abs_s( Word 1 o var 1 ) 


Short absolute value 


Word 1 o sn 1 (Word 1 o var 1 , Word 1 o var2) 


5>nort shirt left 


Wordlo shi\Wordlo varl, Wordlo varz) 


_>nort snitt ngnt 


woroio muiti wora 10 var i, woroio varz) 


Short multiplication 


Word J 2 L_mult( Wordlo varl, wordlo varz) 


Long multiplication 


Word 1 o negate( Word 1 o varl ) 


Short negate 


Word 1 o extract_n( WordJz L_var 1 ) 


Extract high 


Word 1 o extract_ 1 ( Word Jz L_var 1 ) 


Extract low 


Word 1 6 round(Word32 L_var 1 ) 


Round 


Word32 L_mac(Word32 L_var3, Wordlo varl, Wordlo varz) 


Multiply and accumulate 


WordJZ L_msu(WordJz L_varJ, Wordlo varl, Wordlo varZ) 


Multiply and subtract 


Word32 L_add(Word32 L_varl, Word32 L_var2) 


Long addition 


Word32 L_sub(Word32 L_varl, Word32 L_var2) 


Long subtraction 


\\7 f) — ; » _„ »„7A\;~ 1 T"> I ..„_ 1 \ 

Word32 L_negate(Word32 L_varl) 


Long negate 


WI J1 / 1- SW 7 11/" 1 11 7 11/ /N\ 

Word 1 6 mult_r( Word 1 6 var 1 , Word 1 6 var2) 


Multiplication with rounding 


\\J 1 — H» T _ 1 t /\1 7 111 I 1 \\ 7 11/ o\ 

Word32 L_shl(Word32 Levari, Wordlo var2) 


I I_.1V 

Long shift left 


w ora jz i__snr\ w ora j z i__var i , w oro i o varz ) 


Long shift right 


Word 1 6 shr_r( Word 1 6 var 1 , Word 1 6 var2) 


Shift right.with rounding . 


Wordl6 mac_r(Word32 L_var3, Wordl6 varl, Wordl6 var2) 


Mac with rounding 


Word 16 msu_r(Word32 L_var3, Word 16 varl, Word 16 var2) 


Msu with rounding 


Word32 L_deposit_h(Wordl6 varl) 


16-bit varl into MSB part 


Word32 L_deposit_l( Word 1 6 var 1 ) 


16-bit varl into LSB part 


Word32 L_shr_r(Word32 Levari, Word 16 var2) 


Long shift right with round 


Word32 L_abs(Word32 L.varl) 


Long absolute value . 


Word 1 6 norm_s( Word 1 6 var 1 ) 


Short norm 


Wordl6div_s(Wordl6 varl, Wordl6 var2) 


Short division 


Word 16 norm_l(Word32 L_varl) 


Long norm 
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TABLE 12/G.729 
Summary of tables fond in tab. Id8.c 



Table name 


bize 


Description 


tab_hup_s 


28 


Upsampling filter for postfilter 


tab_hup_l 


112 


Upsampling filter for postfilter 


inter_3 


13 


FIR filter for interpolating the correlation 


inter_3 


31 


FIR filter for interpolating past excitation 


Ispcbl 


128 x 10 


LSP quantizer (first stage) 


Ispcb2 


32 x 10 


LSP quantizer (second stage) 


f g 


2x4x 10 


MA predictors in LSP VQ 


fg_sum 


2x 10 


Used in LSP VQ 


fg_sum_inv 


2x 10 


Used in LSP VQ 


gbkl 


8x2 


Codebook GA in gain VQ 


gbk2 


16x2 


Codebook GB in gain VQ 


mapl 


8 


Used in gain VQ 


imapl 


8 


Used in gain VQ 


map2 


16 


Used in gain VQ 


ima21 


16 


Used in gain VQ 


window 


240 


LP analysis window 


lagji 


10 


Lag window for bandwidth expansion (high part) 


lag_l 


10 


Lag window for bandwidth expansion (low part) 


grid 


61 


Grid points in LP to LSP conversion 


tabsqr 


49 


Lookup table in inverse square root computation 


tablog 


33 


Lookup table in base 2 logarithm computation 


table 


65 


Lookup table in LSF to LSP conversion and vice versa 


slope 


64 


Line slopes in LSP to LSF conversion 


tabpow 


33 


Lookup table in 2 X computation j 
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Summary of encoder specific routines 



Filename 


Description 


acelp_co.c 

cod_ld8k.c 

Ipc.c 

pitch.c 

pre_proc.c 

pwf.c 

qua_gain.c 

qua_lsp.c 


Search fixed codebook 
Encoder routine 
LP analysis 
Pitch search 

Pre-processing (HP filtering and scaling) 
Computation of perceptual weighting coefficients 
Gain quantizer 
LSP quantizer 
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TABLE 14/G.729 
Summary of decoder specific routines 



Filename 


Description 


de_acelp.c 


Decode algebraic codebook 


dec_gain.c 


Decode gains 


dec_lag3.c 


Decode adaptive-codebook index 


decjd8k.c 


Decoder routine 


lspdec.c 


LSP decoding routing 


post_pro.c 


Post processing (HP filtering and scaling) 


pst.c 


Postfilter routines 




TARI P 1 S/n 790 
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Summary of general routines 


Filename 


Description 


basicop2.c 


Basic operators 


oper_32bx 


Extended basic operators 


bits.c 


Bit manipulation routines 


dspfuncx 


Mathematical functions 


filter.c 


Filter functions 


gainpred.c 


Gain predictor 


lpcfunc.c 


Miscellaneous routines related to LP filter 


lspgetqx 


LSP quantizer . 


p_parityx 


Compute pitch parity ■ 


pred_lt3.c 


Generation of adaptive codebook 


utile 


Utility functions 
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